The Structure of the Inverse to the Sylvester 

Resultant Matrix 



Boris D. Lubachevsky 

hdl@hell-lahs.com 

Bell Laboratories 
600 Mountain Avenue 
Murray Hill, New Jersey 



Abstract 

Given polynomials a(A) of degree m and h{\) of degree n, we represent the inverse 
to the Sylvester resultant matrix of a and 6, if this inverse exists, as a canonical sum 
of m + 71 dyadic matrices each of which is a rational function of zeros of a and h. As 
a result, we obtain the polynomial solutions x{\) of degree n — 1 and y(A) of degree 
m — 1 to the equation a{\)x{X) + b{X)y{X) = c(A), where c(A) is a given polynomial 
of degree m + n — 1, as follows: x(A) is the Lagrange interpolation polynomial for the 
function c(A)/a(A) over the set of zeros of 6(A) and y{X) is the one for the function 
c(A)/5(A) over the set of zeros of a(A) . 
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A method to solve the equation 

ax + by = c, (1) 
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where a = a(A), b = 6(A), and c = c(A) are given, x = 
univariable polynomials of degrees m,n,m + n — l,n — l, 
the system of m + n linear algebraic equations 



zS 



x(A), and y = y{X) are unknown 
and m — 1, respectively, is to solve 



(2) 



where z = z{x,y) and d = d{c) are m + n-dimensional row-vectors composed of the 
coefficients of polynomials x, y and c, respectively, S = S{a, b) is the (m + n) x (m + n) 
Sylvester resultant matrix of polynomials a and b. Solving equation (|I|) constitutes an 
important single-input /single-output case in the zero placement procedure in control theory 
(see, e.g., @). In this paper we explicitly represent the matrix S^^ as a canonical sum of 
m + n dyadic matrices each of which is a rational function of zeros of a and b; thus we give 
an explicit solution {x{X),y{X)) for (|l|). This solution may have a practical application in 
certain situations of adaptive control. Although the formulation of this solution appears 
very simple (see Corollaries 1 and 3), the author is able to mention no other work with this 
solution. 

In the case in which neither a(A) nor 6(A) have multiple zeros, the representation of 
is much simpler than in the general case. First, we formulate the results for this special case. 

Theorem 1. 

Let a(A) = aoA™" + aiA™~^ + ... + = ao(A — a;i)(A — a2)---(A — am) 

and b{X) = 6oA" + feiA""^ + ... + fe„ = 6o(A - /3i)(A - /32)...(A - f3n) 

be complex polynomials, a^bo ^ 0, and let S = S{a, b) be their Sylvester matrix, 



S{a, b) 
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n + m columns — *• 

If all zeros ai are simple (i.e. pairwise different), and all zeros Pi are simple, then the adjoint 
matrix adj S is 
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adj S{a, b) — 



m+n—l 
m+n—2 



n 


b{ar) 


1 < r < m 


r ^ i 




a{X) 




A — ctj 


\=ai 



0, 0, row„ 



a(A) 



A 



+ 



0£i 



(3) 



m+n—l 
m+n—2 



n 




1 < s < n 






6(A) 









row„(T^),0,...,0 



where rowfc(p(A)) denotes row [po,Pi, ■■■Pfe-i] composed of the coefficients of the polynomial 
p{X) = poA'^-^ +PiA'=-2 + ... 

Corollary 1. 

/n the assumptions of Theorem 1, if dctS{a,b) ^ 0, then 
S{a,b)-^ = 



= E 



i=l 



m+n—l 
m+n—2 



b{ai) 



a{X) 



X — ai 



O,...,0,row„(-^) 
A - a,; 



+ 



A=ai . 



(4) 



+ E 



m+n—l 
m+n—2 



a{(3j) 



X-(5, 



row„(T^),0,...,0 
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and the solution to equation (|T]) is given by the formulas 

A - (3, ci/3,) 

3 



6(A) 




A-/?, 


A=/3, 



a(A) 



a(A) 




A — Q;j 





(5) 



i.e., a;(A) is the Lagrange interpolation polynomial for the function c(A)/a(A) over the set of 
zeros of b{X) and y{X) is the one for the function c(A)/6(A) over the set of zeros of a{X) . 

The following corollary describes the asymptotic structure of S~^, x(A), and y{X) when 
a zero of a (A) approaches a zero of b{X). 



Corollary 2. 

Let the coefficients of the polynomials a(A) = ar{X) and 6(A) = 6r(A) depend on a parameter 
T, T E {t}. Denote by Xr{X) and ?/r(A) the corresponding solutions of (|l|). Let r^, be 
an accumulation point in {r}, and assume that for all t ^ the zeros ai = aj(r) and 
f3j = l3j{T) are simple and that for t ^ we have 



a^(A) ^ a^A), ai{T) a* , i = l,...,m, 6^(A) ^ 6^(A), /?j(r) ^ j = 1, n. 



and a* ^ Pj* for all pairs {i,j), different from {i = 1, j = 1) (zeros of a^:{X) and 6* (A) are 
not necessarily simple), then 



lim iaiiT)-Mr))S{ar,br)-' 



Qm+n—1 
Qm+n—2 



lim (ai(r) - /?i(r))x^(A) 
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a^,(A)6*(A) 




(x-ey 




0(6) 





6*(A) .a*(A) " 

-row„(^— ^), row^(^j— ^) 



a*(A)6*(A) 



lim (ai(r) - /3i(r))?/^(A) 



{x-ey 


A= 


c{0) 




a*(A)6^,(A) 




{x-ey 


X=9 



6. (A) 
A-^ 



a*(A) 
A-^ 
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In the case of simple zeros the representation of relates to the Lagrange interpolation 
formula. In the general case the representation of S^^ shall relate to the general Hermite 
polynomial interpolation formula. A version of this formula is presented below for reference. 

Let 

a(A) = ao(A - - 02)"^^.. (A - as)""' (6) 

be a complex polynomial of degree m = 7711 + 1712 + ■.. + ms, with zeros ai, as, rrik > 0,k = 
1, s, ao 7^ 0, Ofc 7^ ai for k ^ i. Let /(A) be a complex function which for k = 1,2, s 
is defined at A = and has nik — 1 successive derivatives /'^'(afc), /^^'(afc), /^'"''"^^(aA;)- 
The polynomial p(A) of degree no larger than m — 1 for interpolation /(A) is as follows. 

nil — 1 7n2 — 1 nis — 1 

p(A)= E /''i(«iKi(A)+ E /t^i(«2K.(A) + ...+ E fHc^s)usM 

i=0 i=0 i=0 

where Uk,i{X), i = 0,1, — 1, k = 1,2, s, are m polynomials of degrees no larger than 
m — 1. For argument values A = at, k = 1, ...,s, the values of the polynomial p{X) and its 
nik — 1 successive derivatives are the same as those of the function /(A). 

The polynomials independent of / and are uniquely defined given the zeros of the 

polynomial a(A) in (|]) with their multiphcities. They are called the /undamenta/ polynomials. 
For any particular set {mi, 777.2, ...777^} explicit expressions for the fundamental polynomials 
can be derived P, In the statement of Theorem 2 below we assume availability of the 
fundamental polynomials Uk^i corresponding to the polynomial a as in (P). 

Similarly, 77 fundamental polynomials Vf j{X) of degree no larger than n — 1 correspond 
to a complex polynomial 6(A) of degree 77 as follows. If 

h{\) = boix - Pinx - (32r:.i>^ - f3tr, (7) 

the zeros of /?(A) are Pi, Pt, ni > 0, i = 1, t. Pi ^ Pj for i ^ j, then 

771 — 1 772 — 1 77t — 1 

PiX)= E f^'\Pi)vi,iX) + E f'\P2)v2,W + -+ E /'^'(AK,(A) 

J = J = J = 

is the corresponding interpolation polynomial. 

We introduce the notation Afc(A) for the following A-column of height k 

^k-i - 

\k-2 



X 

1 



A, 



def 
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li k = m + n, the subscript can be omitted, A(A) =^ Am+nW- Theorem 1, in particular, 
imphes that in the case of simple zeros each column of S^^ is a linear combination of the 
columns obtained as the values of A(A) when A takes on the values of the zeros of the 
polynomials a and b. The following Theorem 2, in particular, asserts that in the general 
case, each column of S*"^ is a linear combination of the columns obtained as the values of 
A(A) and its derivatives 



aW(A) 



{m + n- l)A™+"-2 
(m + n-2)A'"+"-3 



2A 
1 




A[2](A) 



{m + n - l){m + n - 2)A™+"-3 
(m + n - 2)(m + n - S)X^+^-^ 
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when A takes on the values of the zeros of the polynomials a and b. 
Theorem 2. 

Let a(A) be a complex polynomial of degree m as in (^) and let Uk,i{X), k = l,...,s, i = 
l,...,mfc, be them corresponding fundamental polynomials. Letb{X) be a complex polynomial 
of degree n as in (^, and let Vij{X), i = l,...,t, j = l,...,ne, be the n corresponding 
fundamental polynomials. Let S = S{a, b) be the {m + n) x (m + n) Sylvester matrix defined 
as in Theorem 1. 

I. If detS 7^ 0, then there exist m polynomials Uk^i{X), k = 1, s, i = 1, m^, of degree 
no larger than m — 1, and n polynomials Vej{X), £ = 1, ...,t, j = 1, ne, of degree no larger 
than n — 1, such that 

S{a,b)-^ = 



mi —1 



rris — l 



J2 AM(ai)[0,...,0,row„(f/i,,(A))] + ...+ ^ A[*1(«,) [0, 0, row„(f/,,,(A))] 



i=0 



i=0 



(8) 



rii — 1 



nt-1 



+ J2 [row„(Vl,,(A))0, 0] + ... + Y1 ^''^(/^*) [row„(\4,,(A))0, 0] . 

i=o j=o 

The polynomials Uk,i and Vej, which satisfy (|]) and the restrictions on their degrees, are 
unique. 

II. For each fixed k = 1, 2, s the m^ polynomials Uk^i, i = 0,1, ...,mk-i, can be found 
successively, beginning with i = m^-i, by the following recurrence: 



b{ak) 



mt — 1 



.(A) - E 



n=i+l 



2 = mfc - l,mfc - 2, 1,0. (9) 
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For each fixed £ = 1, 2, t the ni polynomials Vej, j = 0, 1, — 1, can be found succes- 
sively, beginning with j = ne — 1, by the following recurrence: 



a{(3e 



3 \n[h-3] 



ii=i+i 



j = ne- l,n(, - 2, 1,0. 



Example. 

Let ttfc be a zero of multiplicity = 2 of a polynomial a(A). In the decomposition (j^) 
two dyadic summand matrices correspond to this zero, A(afc) [0, 0, roWm(f^fc,o('^))] and 
A[^l(afc)[0, 0, roWm(t^A:,i(A))]. We will now derive an exphcit expression for Ukfi and Uk^i 
using (H). Denote dk{\) = a(A)/(A — a^y. The corresponding fundamental polynomials are 
(see i): 



afc(A) a™(«fe)(^ 
Wfc,ol^J — ^7 — rl-L 



«A:,l(A) 



afc(A)(A - ak) 



Following the procedure in Theorem 2, we begin with the value i = — 1 = 1 in (|^) and 
find 

TT (\\ 1 (\\ afc(A)(A-afc) 



6(afc) 



ak{ak)b{ak) 



mi. — l 



Note that the value of the sum in (|^) is assumed to be in the above calculation, since 

h=i+l 

ruk — 1 < i + 1. Next, for i = rrik — 2 = we calculate 

1 



Uk,oW 



Kak 



-[uk,o{X)-b^'\ak)Uk,iW] 



afc(A) 



b{ak)ak{ak) 



1 - (A - ak) 



aW(afc) bW{ak) 



b{ak) 



(10) 



Corollary 3. 

If detS{a,b) ^ then Equation (|l]) has the following solution: x{X) is the general Hermite 
interpolation polynomial for the function c(A)/a(A) over the set of zeros (with their multi- 
plicities) ofb{X), and similarly y{\) is the interpolation polynomial for the function c(A)/6(A) 
over the set of zeros (with their multiplicities) ofa{X). 
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Discussion. 

If zeros of a and h are simple, Theorem 1 represents adjS" in the form (^) . This representation 
is valid for both cases, detS" = and detS* 7^ 0. A representation for adjS" can be obtained if 
zeros are not simple if we multiply both sides of Equation (H) by detS*. This representation 
is of a form similar to that in (^: it is a linear combination of dyadic matrices. The columns 
in the dyads are the same as in The rows are produced by the operation "row" from 
certain polynomials, which are equal to Uk,i and Vij up to a proportionality constant. These 
"row" polynomials are unique given the limitation on their degree since Uk,i and Vi^j are 
unique polynomials. However this general representation is not necessarily valid in the case 
detS* = 0. If zeros are simple, then the zero terms a{(3j) and in the denominators are 
canceled by the corresponding terms in detS" = Hili ^(c^j) = 11^=1 '^(/^i) which appear in the 
numerators. If zeros are not simple detS" does not necessarily cancel all zero terms in the 
denominators. 

In the example above it may be that the coefficients of 6(A) depend on a parameter r, 
r G {r}, 6(A) = 6r(A), the set {r} has an accumulation point r*, and 6t(«A:) when 
r r*, but hr{ak) 7^ for all t ^ and |6r^(afc)| > const > for all r G {r}. 

Because of the presence of the term — — — in expression ([T0|), Uk o(A)det5' retains b(ak) in 

o{ak) ^ 

the denominator. This, in turn, entails that when r —>■ r*, the summand with A{ak) tends 
to infinity even after multiplication by detS*. 

Because of the uniqueness of Uk^i and Vij in (H), we can be assured that, in general, no 
expression for adjS* of the form similar to (H) exists which is valid for both the cases det^* 7^ 
and detS* = 0. 



An application. 

Solving the system of linear algebraic equations (|]) is easier than finding the zeros of the 
polynomials a(A) and 6(A). Therefore formulas (|D(5) are inefficient for solving an isolated 
instance of zero placement equation (|1|)(1). 

However, in the case in which the coefficients of a(A) and 6(A) are continuously chang- 
ing with time r, formulas (|^) appear more attractive. Instead of solving many instances of 
system (|^), one may track zeros of a(A) and 6(A) as they change and substitute these zeros 
into formulas (^ or into their corresponding generalization. This situation may arise if one 
uses the zero placement procedure (|l|) in the course of adaptive control with adjustable and 
time- variable a(A) = a,- (A) and 6(A) = 6,- (A). 

Proofs. 

A proof is only required for Theorem 2 and Corollary 3. Whereas it is easy to establish 
Corollary 3, given the result of Theorem 2, one can also establish it independently of Theorem 
2, if one computes /(A) = a(A)x(A) + 6(A)?/(A) and the appropriate number of derivatives of 
/(A) for each zero of a and 6. 
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Proof of Theorem 2. 

The notation T0Wk{p{X)) can be extended in an obvious way to the case of a column poly- 
nomial p of degree no larger than — 1, in which case rowfe(p(A)) is a. k x k matrix. We 
have 

v4(rowfc(p(A))) = TOWk{Ap{X)) (11) 



rowfc(j9(A)) + rowfe(g(A)) = rowfc(p(A) + g(A)) (12) 

for a k X k complex matrix A and column A-polynomials p{X) and g(A) of degree no larger 
than k — 1 and height k. 

Let Q be the (m + n) x [m + n) matrix in the right-hand side of Equation (|^). Applying 
rules ([Tl|) and (|12D we obtain 

Q = roWm+„(g(A)), 



g(A) = gi(A) + g2(A), 



where q{X) , qi{X) , q2{X) are column A-polynomials of degree no larger than m + n — 1 and 
height m + n, and the polynomials qi and q2 are given by 



mi. -I 



t ni — 1 



^i(^) = E E A[^l(«.)t^M(A), g2(A) = X"'J2I1 Ayi(/?,)V,,,(A). 

fc=l i=0 £=1 j=0 

The (m + n) x (m + n) identity matrix Im+n can be represented as Im+n = i'0Wm+n(Am+n(A)). 
The condition SQ = Im+n can be rewritten as 

Sq{X) = A(A). 

Using the specific structure of the Sylvester matrix S, we have 



(A) = 

We can also calculate the corresponding r-th derivative 



A„(A)a(A) 
A™(A)6(A) 



^AM(A) = (5A(A))[ 



r 

E 

i = 
r 

E 

i = 



A[:l(A)aI'^-l(A) 
a!^(A)6['--1(A) 



Since a{ak) = aW(afc) = ... = a'™* ^'(ctfc) = 0, k= l,...,s, we have 





^AM| 



r 

E 

i = 



A|^(a,)6[^'-l(«^) 



r = 0, 1, ...,mfc - 1. 
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Summing all terms 5'AM(afc)f/fc_r(A), r = 0, 1, ...,mfc — 1, A; = 1, ...,s, we obtain 



where 



s rn^ — 1 r 

MA) = E E E 

/c=l r=0 i=0 







s rrifc-l mfc-1 

= E E '^S(«') E 

k=l 1=0 r=i 

According to (|^) the following identity holds: 



(13) 



(14) 



The sum in the right-hand side of Equation (|1^) is the same as the internal sum in (pTSl). 
Thus, h{\) = Am(A), according to the general polynomial interpolation formula and 



Sqi{X) 



Similarly the identity 



Sq2i\) 





A^(A) 



A'"A„(A) 




can be established. Hence representation (H) is proven. 

The uniqueness of polynomials Uk^i and Vij easily follows from the non-singularity of the 
{n + m) X {n + m) generalized Vandermonde matrix |Q 

[A(«0,...,A[--y(aO,---,A(a.),...,A[-»-i](a,),A(/?0,...,A["-^](/30,...,A(A),...,^ 

when ttfc 7^ for all pairs /c, £, k = 1, s, i = 1, ...,t. □ 
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